Search Results
Everything WRONG with LLM Benchmarks (ft. MMLU)!!!
SmartGPT: Major Benchmark Broken - 89.0% on MMLU + Exam's Many Errors
AgentBench: NEW Benchmarking Tool CHANGES The LLM LEADERBOARD (Installation Tutorial)
[2024 Best AI Paper] From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning 15min
Explained: The conspiracy to make AI seem harder than it is! By Gustav Söderström
Merge LLMs to Make Best Performing AI Model
Latency 2023 Locknote by Brooke Jamieson - From zero to AI hero
Time Until Superintelligence: 1-2 Years, or 20? Something Doesn't Add Up
ചോദിച്ചാ തരും Airhostess Sri Lanka Airways #airhostess #srilanka #airways #flight #pilot #airport
EfficientML.ai Lecture 14 - LLM Post-Training (MIT 6.5940, Fall 2024, Zoom Recording)
MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning 22min